By preliminary amendment, Applicants correct certain erroneous references to the 
sequence identifiers and add sequence identifiers where appropriate throughout the 
specification. These corrections and additions are shown in detail in the marked-up version of 
the affected paragraphs attached hereto as "Appendix - 1. "Revisions Made to Specification and 
Claims." Claims 7-21 and 34-37 have been canceled, and claims 3-6 and 24, 29, 30 and 32 
have been amended. The changes to the amended claims are also shown in detail in Appendix 
- 1. A complete set of the pending claims, as presently amended, is attached as "Appendix - 2. 
Complete Set of Pending Claims, as Amended on September 17, 2001." 

In the "Notice to File Missing Parts of Nonprovisional Application," the Initial Patent 
Examiner stated that the application is informal for the reason that excessive text is contained in 
the drawings as originally filed. In reply, substitute sheets of the drawings are submitted 
containing Figures 1-12 to replace the set of drawings filed on April 10, 2001. As required by 
the Initial Patent Examiner, the Figures have been revised to omit certain text so as to better 
comply with 37 CFR § 1.84. Text deleted from the Figures has been moved to the 
corresponding figure descriptions in the "Brief Description of the Drawings" section of the 
Specification, or has been moved to the "Detailed Description" section where the particular 
Figure is discussed, or has been deleted because it is considered duplicative of description in 
the Specification as originally filed. A marked-up copy of the Figures is attached as "Appendix - 
3. Details of Drawing Revisions Submitted on September 17, 2001," showing the proposed 
changes by circling the new text in red ink and by striking out deleted text in red ink. If the 
consent of the Examiner is necessary for entry of these substitute sheets of drawings, 
Applicants respectfully request approval by the Examiner of the proposed corrections to the 
Figures and entry of the substitute sheets of drawings . 

Also enclosed is a computer diskette containing the initial computer readable form (CRF) 
of the Sequence Listing and a printed copy of the same, along with a statement that the content 
of the CRF and the original printed copy attached to the Application as filed is the same, and 
that the CRF and the attached printed copy are the same, and that no new matter is introduced 
thereby. 

Applicants respectfully submit that the amendments to the specification, claims and 
drawings presented herein are to correct typographical errors, to facilitate examination of the 
application by ensuring compliance with 37 CFR § 1.821-1.824 or 37 CFR § 1.75(c) or 37 CFR 
§ 1.84 prior to the first examination on the merits, or to comply with specific requirements stated 
in the "Notice to File Missing Parts." No new matter is introduced by way of these amendments . 

If any additional fees are required or if the fees submitted are in excess of that required, 
please charge or credit the appropriate fees to Conley, Rose & Tayon, P.C. Deposit Account No. 
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03-2769/1 889/00401/CGM. In the event that an extension of time is necessary in order for this 
submission to be deemed timely filed, please consider this a petition therefor and charge the 
associated fee to Conley, Rose & Tayon, P.C. Deposit Account No. 03-2769/1 889/00401/CGM. 

Respectfully submitted, 



Darol G. Mintz 0 



Carol 

PTO Reg. No. 38,561 
CONLEY, ROSE & TAYON 
P.O. Box 3267 
Houston, TX 77253-3267 
(713) 238-8000 (Phone) 
(713) 238-8008 (Fax) 
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APPENDIX - 1 

REVISIONS MADE TO REPLACEMENT PARAGRAPHS AND CLAIMS 

[0005] The invention relates to a compound that competitively inhibits binding of CSP rSEQ 
ID NO:41 to S. mutans histidine kinase fSEQ ID NO:61 . The compound is preferably a peptide 
or an antibody. The compoundjs preferably a derivative _of_ [SEQ ID. NO:2] fSEQ ID_NO:31 , a _. 
fragment of [SEQ I D NO:2] [SEQ ID NO:31 or a derivative of a fragment of [SEQ ID NO:2] 
fSEQ ID NQ:3V 

[0006] Schematic layout of the arrangement of the genetic locus encoding the signal peptide 
precursor (ComC) fSEQ ID NO:11 . the histidine kinase (ComD) fSEQ ID NQ:51 and the 
response regulator (ComE) fSEQ ID NO:71 . Note that this arrangement is different from other 
loci in related streptococci for the following reasons: a) The comC gene fSEQ ID NO:1l is 
transcribed from its own unique promoter, unlike the genes thus far described in other 
streptococci that are arranged in an operon-like cluster with the comC/DE genes being 
transcribed from a single promoter. 

[0007] b) The comC gene fSEQ ID NO:11 is separated by 148 nucleotides from the comD 
gene fSEQ ID NO:51 . 

[0008] Shows the nucleic acid molecule that is [SEQ ID NO 1] fSEQ ID NO:31 . In a preferred 
embodiment, the figure shows CSP (competence signal peptide fSEQ ID NO 31 ). Nucleotide 
sequence of the locus. Figure 2 also shows histidine kinase fSEQ ID NO 51 sequences and 
response regulator [SEQ ID NO 71 sequences. 

r0008.51 Figure 2A. S. mutans comC gene fSEQ ID NO:1l. Encodes a precursor to a 
signal peptide TSEQ ID NO:21. 

F0008.61 Figure 2B. S. mutans CSP encoding seguence TSEQ ID NO:31. Encodes a 
Competence Signal Peptide TSEQ ID NO:41. 

f0008.71 Figure 2C. S. mutans comD gene fSEQ ID NO:51. Encodes a protein that 
functions as a histidine kinase receptor fSEQ ID NO:61. 

r0008.81 Figure 2D. S. mutans comE gene TSEQ ID NO:71. Encodes a response 

regulator that activates transcription of a number of genes FSEQ ID NO:81. 

I[0009] Sequence of the deduced amino acid sequence of the signal peptide fSEQ ID NO 41 , 

histidine kinase fSEQ ID NO: 61 . and response regulator fSEQ ID NO: 81 . 

r0009.4l Figure 3A. S, mutans ComC protein (CSP Precursor) fSEQ ID NO:21. 

[0009.51 Figure 3B. S. mutans ComD protein (Histidine Kinase) fSEQ ID NO:61. 

[0009.61 Figure 3C. S, mutans ComE protein (Response Regulator) [SEQ ID NO:8J. 
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[0010] The deduced amino acid sequence of the signal peptide precursor in various strains 
and its predicted cleavage site , fo l low i ng tho g l yc i n e- g l ycino signa l The original peptide is 
expressed as a 46 amino acid peptide that is cleaved after the glycine-glycine residues to 
generate an active signal peptide . 

[001 1 ] Shows the peptide that is [SEQ ID NO:2] fSEQ-lP NO:41 . - The synthetic signal peptide - 

[SEQ ID NO: 161 that is effective at inducing competence, biofilm formation and acid tolerance in 
Streptococcus mutans. 

[0013] Table illustrating the effect of synthetic peptide on genetic competence in S. mutans 
cells. Induction of genetic transformation in Streptococcus mutans by synthetic 
competence stimulating peptide (SCSP). 

[0015] ComCDE local region fSEQ ID NO:231 . The ComC (first highlighted region: 
nucleoti de 101 to 241), ComD (second highlighted region: nucleotides 383 to 1708) and 
ComE (third highlighted region: nucleotides 1705 to 2457) proteins are highlighted. 

[0016] The comX DNA sequence fSEQ ID NO:221 . protein sequence fSEQ ID NO:231 . and 
the comX gene local region fSEQ ID NO:241 with 100bp included both upstream and 
downstream (promoter is upstream). 

r001 6.41 Figure 10A. S. mutans comX gene TSEQ ID NO:221. 

r0016.51 Figure 10B. S. mutans ComX protein fSEQ ID NO:251. 

r0016.61 Figure 10C. S. mutans comX gene local region TSEQ ID NO:261. 

[0017] The comA and comB nucleotide fSEQ ID NO:251 and TSEQ ID NO:271 and amino 

acid sequences fSEQ ID NO:261 and fSEQ ID NO:281 . ComA and ComB are the components 

of the CSP exporter. 



IUU1 T.51 

T0017.61 


i-igure ha. s. mutans comA aene FSEQ ID NO:271. 
Fiaure 11B. S. mutans ComA Drotein TSEQ ID NO:281. 


r0017.7l 


Fiaure 11C. S. mutans comB aene TSEQ ID NO:291. 


T0017.81 


Fiaure 11D. S. mutans ComB Drotein TSEQ ID NO:301. 



[0018] Illustrates the effect of synthetic peptide on acid resistance tolerance in S. mutans 
comC deficient cells. Addition of synthetic signal peptide (CSP) TSEQ ID NO:16l into the 
culture of the comC mutant restored the ability of the mutant to survive a low pH 
challenge when compared to the parent strain NG8. 
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[0020] We have identified a genetic locus in S. mutans consisting of three genes that encode: 
1) a peptide precursor fSEQ ID NO:21 that is processed during export into a secreted 21-amino 
acid peptide (CSP) fSEQ ID NO:41 : 2) a histidine kinase fSEQ ID NO:61 that acts as a cell 
surface receptor activated by the peptide; 3) a response regulator fSEQ ID NO:81 that activates 
a number of other genes involved in genetic competence, biofilm formation, and acid tolerance 
..of S. mutans. These properties have- been attributed to -the bacterium's ability to cause dental - 
caries. Inactivation of any of these three genes or impairment of interaction or activity of any of 
their encoded proteins will disrupt the bacterium's ability to take up foreign DNA, form biofilms, 
and tolerate acidic pH. 

[0021] Streptococcus mutans is a resident of the biofilm environment of dental plaque, a 
matrix of bacteria and extracellular material that adheres to the tooth surface. Under 
appropriate environmental conditions populations of S. mutans and the pH of the surrounding 
plaque will drop. S. mutans, being among the most acid tolerant organisms residing in dental 
plaque, will increase it numbers in this acidic environment and eventually become a dominant 
member of the plaque community. This situation eventually leads to dissolution of the tooth 
enamel, resulting in the development of dental caries. We control the accumulation and acid 
tolerance of this bacterium to make it less able to cause caries. We accomplish this by using 
inhibitors of an extracellular signal peptide that promotes the expression of genes involved in S. 
mutans biofilm formation and acid tolerance. The invention includes compounds that inhibit the 
action of the peptide. These inhibitors can include peptides, antibodies, or other agents that 
specifically inhibit the activation of the histidine kinase and the family of genes activated as a 
result of the histidine kinase activation by the signal molecule. Inhibitors include: modified 
structures of the peptide where amino acids are removed from the N- and/or COOH terminal of 
the peptide and/or substitutions of internal amino acid residues. We delete, one, two to 5, 6 to 
10 and 10 to 15 amino acids from the peptide (for example at either terminal) and measure 
competitive inhibition of signal peptide binding to histidine 

[0022] kinase (1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 or more amino acids are deleted and inhibition 
measured). Inhibitors also include antibodies raised against the 21-amino acid CSP fSEQ ID 
NO:41 alone or coupled to a larger molecule to increase immunogenicity. 

[0024] In addition to identifying the genes encoding this signaling/sensing system, we have 
identified and chemically synthesized a 21-amino acid peptide fSEQ ID NO: 161 that promotes 
biofilm formation and acid tolerance of S. mutans. A survey of the literature and genome 
databases reveals that genes similar to this signal-receptor system are present in most Gram- 
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positive bacteria, and therefore an inhibitor, or family of related inhibitors may be effective at 
inhibiting biofilm formation among a large group of bacteria. 

[0025] The invention treats or prevents dental caries by addition of compounds that inhibit the 
stimulatory action of the 21 -amino acid peptide fSEQ ID NO:41 on biofilm formation and acid 
_ tolerance of S. mutans. This is accomplished by delivery of these -compounds to the biofilm ~ 
and/or to incorporate these inhibitors into materials to control growth on surfaces. This includes 
delivery by topical application, alone or in combination with other compounds including 
toothpaste, mouthwash, food or food additives. 

[0027] The invention includes an isolated CSP from S. mutans. The invention also includes a 
recombinant isolated CSP fSEQ ID NO:41 peptide produced by a cell including a nucleic acid 
molecule encoding CSP [SEQ ID NO:31 operably linked to a promoter. The invention also 
includes an isolated nucleic acid molecule encoding a CSP [SEQ ID NO:31 . The peptide we 
work with is preferably chemically synthesized rSEQ ID NO:16l . 

[0028] The invention includes CSP-encoding nucleic acid molecules rSEQ ID NO:3] and 
molecules having sequence identity or which hybridize to the CSP-encoding sequence and 
which encode a peptide having CSP activity (preferred percentages for sequence identity are 
described below) as well as vectors including these molecules. The invention also includes 
CSP [SEQ ID NO:41 or peptides having sequence identity (preferred percentages described 
below) or which have CSP activity. The nucleic acid molecules and peptides of the invention 
may be from S. mutans and they may be isolated from a native source, synthetic or 
recombinant. The invention includes CSP fSEQ ID NO:41 or peptides having sequence identity, 
which have CSP activity, as prepared by the processes described in this application. 

[0029] The invention includes an isolated HK [SEQ ID NO:61 from S. mutans. The invention 
also includes a recombinant isolated HK polypeptide produced by a cell including a nucleic acid 
molecule encoding HK [SEQ ID NO: 51 operably linked to a promoter. The invention also 
includes an isolated nucleic acid molecule encoding a HK polypeptide [SEQ ID NO:6] . 

[0030] The invention includes HK-encoding nucleic acid molecules and molecules having 
sequence identity or which hybridize to the HK-encoding sequence [SEQ ID NO:51 and which 
encode a protein having HK activity (preferred percentages for sequence identity are described 
below) as well as vectors including these molecules. The invention also includes HK [SEQ ID 
NO:41 or polypeptides having sequence identity (preferred percentages described below) or 
which have HK activity. The nucleic acid molecules and polypeptides of the invention may be 
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from S, mutans and they may be isolated from a native source, synthetic or recombinant. The 
invention includes HK [SEQ ID NO:41 or polypeptides having sequence identity, which have HK 
activity, as prepared by the processes described in this application. 

[0031] The invention includes an isolated RR fSEQ ID NO:61 from S. mutans. The invention 
also includes a recombinant isolated RR fSEQ ID NO:61 polypeptide produced by a cell 
- including a nucleic acid molecule encoding RR fSEQ ID NO:61 operably linked to a promoter. 
The invention also includes an isolated nucleic acid molecule encoding a RR polypeptide. 

[0032] The invention includes RR-encoding nucleic acid molecules and molecules having 
sequence identity or which hybridize to the RR-encoding sequence fSEQ ID NO:51 and which 
encode a polypeptide having RR activity (preferred percentages for sequence identity are 
described below) as well as vectors including these molecules. The invention also includes RR 
fSEQ ID NO:6T or polypeptides having sequence identity (preferred percentages described 
below) or which have RR activity. The nucleic acid molecules and polypeptides of the invention 
may be from S. mutans and they may be isolated from a native source, synthetic or 
recombinant. The invention includes RR [SEQ ID NO:61 or polypeptides having sequence 
identity, which have RR activity, as prepared by the processes described in this application. 

[0033] The comA and comB nucleotide fSEQ ID NO:27 and SEQ ID NO:291 and amino acid 
sequences fSEQ ID NO:28 and SEQ ID NO:301 are also aspects of the invention. ComA and 
ComB are components of the CSP exporter. The discussion of variants, sequence identity etc. 
for CSP, HK, RR applies to both the full sequences shown in the figures as well as bracketed 
portions of sequences (coding regions). The peptides and polypeptides may be natural, 
recombinantly produced or synthetic. 

[0034] The invention includes nucleic acid molecules that are functional equivalents of all or 
part of the CSP sequence in [SEQ ID NO:1] [SEQ ID NO:31 . (A nucleic acid molecule may also 
be referred to as a DNA sequence or nucleotide sequence in this application. All these terms 
have the same meaning as nucleic acid molecule). Functionally equivalent nucleic acid 
molecules are DNA and RNA (such as genomic DNA, complementary DNA, synthetic DNA, and 
messenger RNA molecules) that encode peptides having the same or similar CSP activity as 
the CSP peptide shown in [SEQ I D NO:2] [SEQ ID NO:41 . Functionally equivalent nucleic acid 
molecules can encode peptides that contain a region having sequence identity to a region of a 
CSP peptide [SEQ ID NO:41 or more preferably to the entire CSP peptide. Identity is calculated 
according to methods known in the art. The ClustalW program (preferably using default 
parameters) [Thompson, JD et al., Nucleic Acid Res. 22:4673-4680.], described below, is most 
preferred. For example, if a nucleic acid molecule (called "Sequence A") has 90% identity to a 
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portion of the nucleic acid molecule in [SEQ I D NO:1] fSEQ ID NO:31 , then Sequence A will 
preferably be identical to the referenced portion of the nucleic acid molecule in [SEQ I P NO:1] 
fSEQ ID NO:31 . except that Sequence A may include up to 10 point mutations, such as 
substitutions with other nucleotides, per each 100 nucleotides of the referenced portion of the 
nucleic acid molecule in [SEQ ID NO:1] fSEQ ID NO:31 . Mutations described in this application 
preferably do not disrupt the reading frame of the coding sequence; Nucleic acid molecules 
functionally equivalent to the CSP sequences can occur in a variety of forms as described 
below. 

[0035] Nucleic acid molecules may encode conservative amino acid changes in CSP peptide 
fSEQ ID NO:41 . The invention includes functionally equivalent nucleic acid molecules that encode 
conservative amino acid changes within a CSP amino acid sequence and produce silent amino 
acid changes in CSP. 

[0036] Nucleic acid molecules may encode non-conservative amino acid substitutions, 
additions or deletions in CSP peptide. The invention includes functionally equivalent nucleic acid 
molecules that make non-conservative amino acid changes within the CSP amino acid sequence 
in [SEQ I D NO:2] fSEQ ID NO:41 . Functionally equivalent nucleic acid molecules include DNA 
and RNA that encode peptides, peptides and proteins having non-conservative amino acid 
substitutions (preferably substitution of a chemically similar amino acid), additions, or deletions but 
which also retain the same or similar CSP activity as the CSP peptide shown in [SEQ ID NQ:2] 
fSEQ ID NO:41 . The DNA or RNA can encode fragments or variants of CSP. Fragments are 
useful as immunogens and in immunogenic compositions (U.S. Patent No. 5,837,472). The CSP 
or CSP-like activity of such fragments and variants is identified by assays as described below. 
Fragments and variants of CSP encompassed by the present invention should preferably have at 
least about 40%, 60%, 80% or 95% sequence identity to the naturally occurring CSP nucleic acid 
molecule, or a region of the sequence, such as the coding sequence or one of the conserved 
domains of the nucleic acid molecule, without being identical to the sequence in [SEQ I D NO:1] 
[SEQ ID NO:31 . Sequence identity is preferably measured with the ClustalW program (preferably 
using default parameters) (Thompson, JD et al., Nucleic Acid Res. 22:4673-4680) 

[0037] Nucleic acid molecules functionally equivalent to the CSP nucleic acid molecule in 
[SEQ ID NO:1] fSEQ ID NO:3] will be apparent from the following description. For example, the 
sequence shown in [SEQ I D NO;1] fSEQ ID NO:31 may have its length altered by natural or 
artificial mutations such as partial nucleotide insertion or deletion, so that when the entire length 
of the coding sequence within [SEQ I D NO:11 [SEQ ID NO:31 , is taken as 100%, the functional 
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equivalent nucleic acid molecule preferably has a length of about 60-120% thereof, more 
preferably about 80-1 10% thereof. Fragments may be less than 60%. 

[0038] Nucleic acid molecules containing partial (usually 80% or less, preferably 60% or less, 
more preferably 40% or less of the entire length) natural or artificial mutations so that some 
codons in these sequences code for different amino acids, but wherein the resulting peptide 
retains" the same or similar CSP activity as that of a naturally occurring CSP peptide fSEQ ID 
NO:41 . The mutated DNAs created in this manner should preferably encode a peptide having at 
least about 40%, preferably at least about 60%, at least about 80%, and more preferably at 
least about 90% or 95% sequence identity to the amino acid sequence of the CSP peptide in 
[SEQ I D NO:2] fSEQ ID NO:41 . The ClustalW program preferably assesses sequence identity. 

[0039] Since the genetic code is degenerate, the nucleic acid sequence in [SEQ I D NO:1] 
[SEQ ID NO:31 is not the only sequence which may code for a peptide having CSP activity. 
This invention includes nucleic acid molecules that have the same essential genetic information 
as the nucleic acid molecule described in [SEQ ID NO:1] [SEQ ID NO:31 . Nucleic acid 
molecules (including RNA) having one or more nucleic acid changes compared to the 
sequences described in this application and which result in production of a peptide shown in 
[SEQ I D NO:2] [SEQ ID NO:41 are within the scope of the invention. 

[0040] Other functional equivalent forms of CSP-encoding nucleic acids can be isolated 
using conventional DNA-DNA or DNA-RNA hybridization techniques. Thus, the present 
invention also includes nucleic acid molecules that hybridize to one or more of the sequences in 
[SEQ I D NO:1] [SEQ ID NO:31 or its complementary sequence, and that encode expression for 
peptides, peptides and proteins exhibiting the same or similar activity as that of the CSP peptide 
produced by the DNA in [SEQ I D NO:1] [SEQ ID NO:31 or its variants. Such nucleic acid 
molecules preferably hybridize to the sequence in [SEQ ID NO:1] fSEQ ID NO:31 under 
moderate to high stringency conditions (see Sambrook et al. Molecular Cloning: A Laboratory 
Manual, Most Recent Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.). 
High stringency washes have low salt (preferably about 0.2% SSC), and low stringency washes 
have high salt (preferably about 2% SSC). A temperature of about 37 °C or about 42 °C is 
considered low stringency, and a temperature of about 50-65 °C is high stringency. The 
invention also includes a method of identifying nucleic acid molecules encoding a CSP activator 
peptide (preferably a mammalian peptide), including contacting a sample containing nucleic acid 
molecules including all or part of [SEQ I D NO:1] [SEQ ID NO:31 (preferably at least about 15 or 
20 nucleotides of fSEQ ID NO:11 [SEQ ID NO:31 ) under moderate or high stringency 
hybridization conditions and identifying nucleic acid molecules which hybridize to the nucleic 
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acid molecules including all or part of TSEQ I D NO:1l TSEQ ID NO:31 .l Similar methods are 
described in U.S. Patent No. 5,851,788, which is incorporated by reference in its entirety. 
[0041] The invention also includes methods of using all or part of the nucleic acid molecules 
which hybridize to all or part of [SEQ I D NO:1] fSEQ ID NO:31 . for example as probes or in 
assays to identify antagonists or inhibitors of the peptides produced by the nucleic acid 
-molecules (described below). -The invention also includes methods of using nucleic acid- - 
molecules having sequence identity to the CSP nucleic acid molecule fSEQ ID NO:31 (as 
described below) in similar methods. 

[0042] The invention also includes a nucleic acid molecule detection kit including, preferably 
in a suitable container means or attached to a surface, a nucleic acid molecule of the invention 
encoding CSP fSEQ ID NO:41 or a peptide having CSP activity and a detection reagent (such 
as a detectable label). Other variants of kits will be apparent from this description and teachings 
in patents such as U.S. Patent Nos. 5,837,472 and 5,801, 233,which are incorporated by 
reference in their entirety. 

[0043] A nucleic acid molecule described above is considered to have a function substantially 
equivalent to the CSP nucleic acid molecules fSEQ ID NO:31 of the present invention if the 
peptide rSEQ ID NO:41 produced by the nucleic acid molecule has CSP activity. A peptide 
has CSP activity if it can stimulate genetic competence and acid tolerance in S. mutans. 
Activation of the HK fSEQ ID NO:61 /RR fSEQ ID NO:81 is shown where a peptide is capable of 
stimulating the uptake and incorporation of foreign DNA. We describe below how the activity of 
these peptide-mediated processes can be measured by determining the efficiency of plasmid 
uptake, which is a measure of genetic competence. Since the ability to transport and 
incorporate foreign DNA relies on activation of the HK fSEQ ID NO:61/ RR fSEQ ID NO:81 and 
subsequent genes activated by the signal cascade initiated by the signal peptide, measurement 
of the conferment of erythromycin resistance by cells exposed to the peptide and plasmid DNA 
conferring erythromycin resistance indicates its level of function. Conversely if an inhibitor is 
capable of interfering with the action of the peptide the competence assay will indicate this by a 
corresponding decrease in the number of cells that acquire erythromycin resistance as 
described in the assays below (assays of genetic competence and assay of transformation of 
biofilms). Activation of the HK fSEQ ID NO:61 /RR fSEQ ID NO:81 is also shown where a 
peptide is capable of stimulating an acid tolerance response. We describe below how the 
activity of these peptide-mediated processes can be measured by determining the survival rate 
of cells in acidic pH conditions. Since the ability to survive exposure to acidic pH depends on 
the activation of the HK/RR and subsequent genes activated by the signal peptide, 
measurement of the survival of S. mutans in low pH conditions indicates the level of function of 
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the signal peptide. Conversely, if an inhibitor is capable of interfering with the signal peptide 
sensing system the assay for acid adaptation will indicate this by a corresponding decrease in 
the survival rate of cells grown in acidic pH conditions as described in the assay below (assay of 
acid adaptation). 

[0049] Another embodiment of the invention relates to a method of transfecting a cell with a 

nucleic acid molecule of the invention, inserted in an expression vector to produce a cell 
expressing the CSP peptide fSEQ ID NO:41 or other peptide of the invention. The invention 
also relates to a method of expressing the peptides of the invention in a cell. A preferred 
process would include culturing a cell including a recombinant DNA vector including a nucleic 
acid molecule encoding CSP [SEQ ID NO:31 (or another nucleic acid molecule of the invention) 
in a culture medium so that the peptide is expressed. The process preferably further includes 
recovering the peptide from the cells or culture medium. 

[0050] The invention also includes oligonucleotide probes made from the cloned CSP nucleic 
acid molecules described in this application or other nucleic acid molecules of the invention (see 
Materials and Methods section). The probes may be 15 to 20 nucleotides in length. A preferred 
probe is at least 15 nucleotides of CSP in [SEQ ID NO:1] fSEQ ID NO:31 . The invention also 
includes at least 15 consecutive nucleotides of fSEQ I D NO: 11 fSEQ ID NO:31 . The probes are 
useful to identify nucleic acids encoding CSP peptides as well as peptides functionally 
equivalent to CSP. The oligonucleotide probes are capable of hybridizing to the sequence 
shown in [SEQ I D NO:1] fSEQ ID NO:31 under stringent hybridization conditions. A nucleic acid 
molecule encoding a peptide of the invention may be isolated from other organisms by 
screening a library under moderate to high stringency hybridization conditions with a labeled 
probe. The activity of the peptide encoded by the nucleic acid molecule is assessed by cloning 
and expression of the DNA. After the expression product is isolated, the peptide is assayed for 
CSP activity as described in this application. 

[0051] Functionally equivalent CSP nucleic acid molecules from other cells, or equivalent 
CSP-encoding cDNAs or synthetic DNAs, can also be isolated by amplification using 
Polymerase Chain Reaction (PCR) methods. Oligonucleotide primers, such as degenerate 
primers, based on [SEQ I D NO :1 ] fSEQ ID NO:31 can be prepared and used with PCR and 
reverse transcriptase (E. S. Kawasaki (1990), In Innis et al., Eds., PCR Protocols, Academic 
Press, San Diego, Chapter 3, p. 21) to amplify functional equivalent DNAs from genomic or 
cDNA libraries of other organisms. The oligonucleotides can also be used as probes to screen 
cDNA libraries. 
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[0052] The present invention includes not only the peptides encoded by the sequences of the 
invention, but also functionally equivalent peptides, peptides and proteins that exhibit the same 
or similar CSP peptide activity. A peptide is considered to possess a function substantially 
equivalent to that of the CSP peptide fSEQ ID NO:41 if it has CSP activity. CSP activity means 
that it is able to confer genetic competence to S. mutans, as measured by an increased ability to 
incorporate and express foreign genetic material; when" added to cells as described in the assay 
of genetic competence below. CSP activity also means that the peptide is able to confer an 
acid tolerance response in S. mutans as measured by an increase in cell survival under acidic 
pH conditions when added to cells as described in the assay for acid adaptation below. 
Functionally equivalent peptides, peptides and proteins include peptides, peptides and proteins 
that have the same or similar protein activity as CSP when assayed, i.e. they are able to 
stimulate genetic competence and low pH tolerance (the ability to withstand acid challenges of 
pH 3.5 -pH 3.0 for up to 3 hours) in S. mutans. A peptide has CSP activity if it is capable of 
increasing the frequency of uptake and expression of foreign DNA as described in the following 
assay for genetic competence and if the peptide can promote an acid tolerance response as 
described in the assay for acid adaptation. 

[0053] Identity refers to the similarity of two peptides or proteins that are aligned so that the 
highest order match is obtained. Identity is calculated according to methods known in the art, 
such as the ClustalW program. For example, if a peptide (called "Sequence A") has 90% 
identity to a portion of the peptide in [SEQ I D NO:2] fSEQ ID NO:161 . then Sequence A will be 
identical to the referenced portion of the peptide in [SEQ ID NO:2] [SEQ ID NO:161 , except that 
Sequence A may include up to 1 point mutations, such as substitutions with other amino acids, 
per each 10 amino acids of the referenced portion of the peptide in [SEQ ID NQ:2] fSEQ ID 
NO:161 . Peptides, peptides and proteins functional equivalent to the CSP peptides can occur in 
a variety of forms as described below. 

[0054] Peptides biologically equivalent in function to CSP peptide include amino acid 
sequences containing amino acid changes in the CSP sequence fSEQ ID NO:41 . The 
functional equivalent peptides have at least about 40% sequence identity, preferably at least 
about 60%, at least about 75%, at least about 80%, at least about 90% or at least about 95% 
sequence identity, to the natural CSP peptide rSEQ ID NO:41 or a corresponding region. The 
ClustalW program preferably determines sequence identity. Most preferably, 1, 2, 3, 4, 5, 5-10, 
10-15 amino acids are modified. 
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[0058] The variants preferably retain the same or similar CSP activity as the naturally 
occurring CSP fSEQ ID NO:41 . The CSP activity of such variants can be assayed by 
techniques described in this application and known in the art. 

[0059] Variants produced by combinations of the techniques described above but which 
retain the same or similar CSP activity as naturally occurring CSP rSEQ ID NO:41 are also 
included in the invention (for example, combinations of amino acid additions, and substitutions). 

[0062] As well, probes and antibodies for a histidine kinase fSEQ ID NO:5 and SEQ ID 
NO:6] , response regulator fSEQ ID NO:7 and SEQ ID NO:81 comA fSEQ ID NO:27 and SEQ 
ID NO:281 or comB fSEQ ID NO:29 and SEQ ID NO:301 may be prepared using the description 
in this application and techniques known in the art. The description for preparation of CSP 
variants and mutants is also applicable to the histidine kinase fSEQ ID NO:5 and SEQ ID 
NO:61 , response regulator fSEQ ID NO:7 and SEQ ID NO:81 or comA FSEQ ID NO:27 and 
SEQ ID NO:281 and comB fSEQ ID NO:29 and SEQ ID NO:301 of the invention. The invention 
also includes fragments of HK fSEQ ID NO:5 and SEQ ID NO:61 having HK activity, fragments 
of RR fSEQ ID NO:7 and SEQ ID NO:81 having RR activity and fragments of comA fSEQ ID 
NO:27 and SEQ ID NO:281 or comB fSEQ ID NO.29 and SEQ ID NO:301 having activity. 

[0063] The activity of the CSP peptide [SEQ ID NO:41 may be varied by carrying out 
selective site-directed mutagenesis. We characterize the binding domain and other critical 
amino acid residues in the peptide that are candidates for mutation, insertion and/or deletion. 
Sequence variants may be synthesized. A DNA plasmid or expression vector containing the 
CSP nucleic acid molecule [SEQ ID NO:31 or a nucleic acid molecule having sequence identity 
may be used for these studies using the U.S.E. (Unique site elimination) mutagenesis kit from 
Pharmacia Biotech or other mutagenesis kits that are commercially available, or using PCR. 
Once the mutation is created and confirmed by DNA sequence analysis, the mutant peptide is 
expressed using an expression system and its activity is monitored. This approach is useful to 
identify CSP inhibitors. All these modifications of the CSP DNA sequences fSEQ ID NO:31 
presented in this application and the peptides produced by the modified sequences are 
encompassed by the present invention. 

[0064] The CSP inhibitors are also useful when combined with a carrier in a pharmaceutical 
composition. The compositions are useful when administered in methods of medical treatment 
or prophylaxis of a disease, disorder or abnormal physical state caused by S. mutans. The 
invention also includes methods of medical treatment of a disease, disorder or abnormal 
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physical state characterized by excessive S. mutans or levels or activity of CSP peptide fSEQ 
ID NO:41 , for example by administering a pharmaceutical composition including a carrier and a 
CSP inhibitor. Caries is one example of a disease, which can be treated or prevented by 
antagonizing CSP fSEQ ID NO:41 . 

[0066]' CSP activity could" be blocked by antisense mRNA or by inhibiting the "activity "of the 
exporter that secretes it from the cell. We have the sequence of these exporters. There are two 
copies of the genes (comAB) fSEQ ID NO:27 and SEQ ID NO:291 that are involved in export. 

[0067] Nucleic acid molecules (antisense inhibitors of CSP) [SEQ ID NO:31 and competitive 
inhibitors of CSP fSEQ ID NO:41 may be introduced into cells using in vivo delivery vehicles such 
as liposomes. They may also be introduced into these cells using physical techniques such as 
microinjection and electroporation or chemical methods such as coprecipitation or using liposomes. 

[0070] Antibodies directed against the CSP [SEQ ID NO:4 or SEQ ID NO:161 would provide 
protection against caries. Antibodies may be manufactured as described below. Alternatively, a 
peptide of the invention [SEQ ID NO:4 or SEQ ID NO: 161 or a fragment thereof may be used 
with a carrier to make a vaccine. The peptide or fragment may also be conjugated to another 
molecule to increase its antigenicity. Antibodies can also be coupled to the peptide (Brady, L.J. 
et al., "Monoclonal Antibody-Mediated Modulation of the Humoral Immune Response against 
Mucosally Applied Streptococcus mutans" (in press). In order to enhance the immune response 
the peptide can be coupled to KLH, ovalbumin, or thyroglobulin prior to immunization. The 
vaccine composition will trigger the mammal's immune system to produce antibodies. The 
invention includes vaccine compositions and methods of vaccinating a mammal, preferably a 
human, against dental caries by administering to the mammal an effective amount of a vaccine 
composition. Techniques for preparing and using vaccines are known in the art. To prepare the 
vaccine, the peptide, or a fragment of the peptide, may be mixed with other antigens (of different 
immunogenicity), a vehicle or an excipient. Examples of peptide vaccines are found in U.S. 
Patent Nos. 5,679,352, 5,194,254 and 4,950,480. Techniques for preparing vaccines involving 
site-directed mutagenesis are described in U.S. Patent Nos. 5,714,372, 5,543,302, 5,433,945, 
5,358,868, 5,332,583, 5,244,657, 5,221,618, 5,147,643, 5,085,862 and 5,073,494. Vaccines 
may be administered by known techniques, such as topical or parenteral administration. Vast 
changes are taking place in vaccinology consequent to the introduction of new technologies. 
Acellular purified fractions devoid of side effects, non-pathogenic but immunogenic mutants, 
recombinant technology, conjugated vaccines, combination vaccines (to limit the number of 
injections). Vaccine delivery systems can deliver multiple doses of the vaccine at a single 
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contact point. A genetically engineered oral vaccine is useful to impart better and longer 
duration of immunity. Oral vaccines are useful. The nose as a route for immunization is also 
useful. DNA alone can constitute the vaccines, inducing both humoral and cell-mediated 
immune responses. Live recombinant vaccines are also useful. Potent adjuvants add to the 
efficacy of the vaccines. One can also 'humanize 1 mouse monoclonals by genetic engineering 
and express these efficiently in plants. These recombinant antibodies are opening out an era" of 
highly specific and safe therapeutic interventions. An advantage of preformed antibodies 
directed at a defined target and given in adequate amounts is the certainty of efficacy in every 
recipient, in contrast to vaccines, where the quality and quantum of immune response varies 
from individual to individual. For example, nasal immunization may be done as described in C. 
Jespersgaard et al. "Protective Immunity against Streptococcus mutans Infection in Mice after 
Intranasal Immunization with the Glucan-Binding Region of S. mutans Glucosyltransferase" 
Infection and Immunity, December 1999, p. 6543-6549, Vol. 67, No. 12. Vaccine compositions 
may comprise solid or liquid formulations such as gels, sprays, inhalants, tablets, toothpastes, 
mouthwashes or chewing gum. 

[0072] Inhibitors are preferably directed towards CSP fSEQ ID NO:4 or SEQ ID NO: 161 to 
block S. mutans competence, low pH tolerance and biofilm formation. 

[0073] A method of identifying a compound which reduces the interaction of CSP fSEQ ID 
NO:4 or SEQIPNO:16l with HK [SEQ ID NO:61 . can include: contacting (i) CSP [SEQ ID 
NO:4 or SEQ ID NO:161 with (ii) HK [SEQ ID NO:61 . a CSP-binding fragment of HK fSEQ ID 
NO:61 or a derivative of either of the foregoing in the presence of the compound; and b) 

determining whether the interaction between (i) and (ii) is reduced, thereby indicating that 
the compound reduces the interaction of CSP fSEQ ID NO:4 or SEQ ID NO:16l and HK [SEQ 
ID NO:61 . A CSP inhibitor (caries treating or preventing compound) inhibits the interaction 
between (i) and (ii). By way of example, one can screen a synthetic peptide library. One could 
also screen small non-peptide organic molecules. 

[0074] In one embodiment, the invention includes an assay for evaluating whether test 
compounds are capable of acting as agonists or antagonists for CSP fSEQ ID NO:4T . or a 
peptide having CSP functional activity, including culturing cells containing DNA which expresses 
CSP [SEQ ID NO:31 . or a peptide having CSP activity so that the culturing is carried out in the 
presence of at least one compound whose ability to modulate CSP activity is sought to be 
determined and thereafter monitoring the cells for either an increase or decrease in the level of 
CSP [SEQ ID NO:4 or SEQ ID NO:161 or CSP activity. Other assays (as well as variations of 
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the above assay) will be apparent from the description of this invention and techniques such as 
those disclosed in U.S. Patent No. 5,851,788, 5,736,337 and 5,767,075 which are incorporated 
by reference in their entirety. For example, the test compound levels may be either fixed or 
variable. 

[0075] The CSP [SEQ ID NO:4 or SEQ ID NO:16l peptide is also useful as an antigen for the 

preparation of antibodies that can be used to purify or detect other CSP-like peptides. Antibodies 

may also block CSP fSEQ ID NO:41 binding to HK fSEQ ID NO:61 . Antibodies are preferably 
targeted to the entire CSP fSEQ ID NO:41 sequence. The CSP peptide [SEQ ID NO:4 or SEQ 
ID NO: 161 may be conjugated to other compounds, in order to increase immunogenicity. 

[0076] We generate polyclonal antibodies against the CSP [SEQ ID NO:4 or SEQ ID 
NO:161 , which is a unique sequence. Monoclonal and polyclonal antibodies are prepared 
according to the description in this application and techniques known in the art. For examples 
of methods of preparation and uses of monoclonal antibodies, see U.S. Patent Nos. 5,688,681, 
5,688,657, 5,683,693, 5,667,781, 5,665,356, 5,591,628, 5,510,241, 5,503,987, 5,501,988, 
5,500,345 and 5,496,705, which are incorporated by reference in their entirety. Examples of the 
preparation and uses of polyclonal antibodies are disclosed in U.S. Patent Nos. 5,512,282, 
4,828,985, 5,225,331 and 5,124,147 which are incorporated by reference in their entirety. 
Antibodies recognizing CSP [SEQ ID NO:4 or SEQ ID NO:161 can be employed to screen 
organisms or tissues containing CSP peptide fSEQ ID NO:41 or CSP-like peptides. The 
antibodies are also valuable for immuno-purification of CSP [SEQ ID NO:41 or CSP-like 
peptides from crude extracts. 

[0077] An antibody (preferably the antibody described above) may be used to detect CSP 
fSEQ ID NO:41 or a similar peptide, for example, by contacting a biological sample with the 
antibody under conditions allowing the formation of an immunological complex between the 
antibody and a peptide recognized by the antibody and detecting the presence or absence of 
the immunological complex whereby the presence of CSP [SEQ ID NO:41 or a similar peptide is 
detected in the sample. The invention also includes compositions preferably including the 
antibody, a medium suitable for the formation of an immunological complex between the 
antibody and a peptide recognized by the antibody and a reagent capable of detecting the 
immunolgical complex to ascertain the presence of CSP fSEQ ID NO:41 or a similar peptide. 
The invention also includes a kit for the in vitro detection of the presence or absence of CSP 
[SEQ ID NO:41 or a similar peptide in a biological sample, wherein the kit preferably includes an 
antibody, a medium suitable for the formation of an immunological complex between the 
antibody and a peptide recognized by the antibody and a reagent capable of detecting the 
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immunological complex to ascertain the presence of CSP rSEQ ID NO:41 or a similar peptide in 
a biological sample. Further background on the use of antibodies is provided, for example in 
U.S. Patent Nos. 5,695,931 and 5,837,472, which are incorporated by reference in their entirety. 
[0078] The ability of the peptide to activate the HK fSEQ ID NO: 61 and RR fSEQ ID NO:81 
and the subsequent genes involved in the conferral of the properties of genetic competence, 
- acid tolerance and biofilm formation can be determined by measuring the efficiency of uptake 
and expression of DNA (preferably plasmid DNA) in S. mutans when exposed to signal peptide 
and/or inhibitor. Two methods modified based on the protocols described by Perry et al. Infect 
Immun, 41:722-727 and Lindler and Macrina J Bacteriol, 166:658-665 are used to assay genetic 
competence. The method involves adding DNA and CSP fSEQ ID NO:31 (preferably plasmid 
DNA) to a S. mutans culture (or culture of a bacteria expressing CSP fSEQ ID NO:31 or a 
variant thereof). The rate of transformation is then determined. S. mutans is preferably grown in 
THYE plus 5% horse serum (THYE-HS). After 2-hr incubation, 1 jag/ml plasmid DNA or 10 jug/ml 
of chromosomal DNA is added to the culture. To assay induction of competence, competence 
signal peptide, (SCSP) fSEQ ID NO: 161 is then added to the cultures, incubation continued for 
30 minutes with a final concentration of 500 ng/ml of SCSP added to each sample. After the 30- 
minute incubation equal amounts of DNA is added to each well (1 ng/ml plasmid or 10 ^g/ml of 
chromosomal DNA) and incubation continued for another 2 hrs.. Cell dilutions were immediately 
spread on THYE agar plates plus appropriate antibiotics. Transformation frequency was 
expressed as the number of transformants (antibiotic resistant cells) per number of viable 
recipients. This is determined by comparing the number of cells able to grow in the presence of 
antibiotic (conferred by the applied plasmid or chromosomal DNA) relative to the total number of 
cells present (i.e., that grow in the absence of antibiotic). A higher value indicates a higher rate 
of transformation and thus is reflective of a stimulatory effect by the peptide. Consequently, 
addition of a molecule that successfully acts as an inhibitor results in a lower ratio of 
transformants/recipients, indicating that the inhibitor is effective at blocking activity of the CSP 
fSEQ ID NO:41 . CSP deficient cells fSEQ ID NO:3 or SEQ ID NO: 41 may also be used in a 
variation of these assays. One can identify compounds that inhibit CSP rSEQ ID NO:41 or 
variants thereof by adding a test compound to the mixture to determine if the rate of 
transformation is decreased by the addition of the test compound. 

[0079] The activity of the system can also be measured by an in vitro assay that relies on the 
measurement of marker protein expression (such as green fluorescent protein (GFP)) via 
expression from a fusion to a promoter controlled by the signal cascade initiated by CSP fSEQ 
ID NO:4V HK fSEQ ID NO:61 /RR fSEQ ID NO:81 . One such promoter occurs immediately 5' 
proximal to the S. mutans comX gene. S. mutans cells grown in microtiter wells are exposed to 
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the CSP rSEQ ID N0:41 and/or inhibitor and the level of fluorescence of the comX::GFP strain 
is measured to give a quantitative measure of CSP fSEQ ID NO:41 stimulation (and conversely 
inhibitor activity). One can identify compounds that inhibit CSP fSEQ ID NO:41 or variants 
thereof by adding a test compound to the mixture to determine if the quantitative measure of 
CSP fSEQ ID NO:41 stimulation is decreased by the addition of the test compound. 
[0080] The ability of CSP fSEQ ID NO:41 to promote acid resistance tolerance is determined 
by measuring the cell survival rate of S. mutans when exposed to acidic pH. In one example, S. 
mutans are first grown in batch culture to assay acid tolerance response in 'standard' log- and 
stationary-phase cells by using a modification of methods described previously by Svensater et 
al. Oral Microbiol. Immunol., 12:266-73. Mid-log-phase cells are obtained by transferring one 
volume of overnight culture into nine volumes (1:10) of fresh TYG medium (pH 7.5) and 
incubated at 37°C with 5% C0 2 for 2 hours. These cells are then collected by centrifugation at 
8,000 x g for 10 min and resuspended in 2 ml of fresh TYG (pH 5.5) at various cell densities as 
determined by O.D 600 - The cells are induced for acid adaptation by incubation at pH 5.5 for 2 h 
at 37°C with 5% C0 2 . The adapted log-phase cells are then exposed to the killing pH. Killing 
pH is pre-determined by incubating unadapted, mid-log phase cells in TYG medium at pH 
values from 6.0 to 2.0. Stationary-phase cells are prepared by re-suspending late-log phase 
cells in TY medium (tryptone-yeast extract) without glucose. The culture is incubated at 37°C 
for 2 h to allow the cells to fully enter into stationary phase. Induction of acid adaptation in 
stationary-phase cells follows a similar procedure to that for log-phase cells. Adaptation of both 
log- and stationary-phase cells to acidic pH is determined by measuring the ability of bacterial 
cells to survive a killing pH for 3 h. Acid killing is initiated by resuspending cells in the same 
volume of fresh TYG (pH 3.5) and an aliquot of cell suspension is taken immediately from each 
sample to determine total viable cell number at zero time. The cells are then incubated for 3 h 
at 37°C with 5% C0 2 and an aliquot of sample is taken to determine survival rate by viable cell 
counts. Addition of a molecule that successfully acts as an inhibitor results in a decrease in the 
acid resistance tolerance of S mutans resulting in a corresponding decrease in cell survival 
indicating that the inhibitor is effective at blocking activity of CSP fSEQ ID NO:41. CSP [SEQ 
ID NO:3 or SEQ ID NQ:41 deficient cells may also be used in a variation of these assays 
wherein addition of the signal peptide can complement the acid-adaptation-defective phenotype 
of a comC fSEQ ID NO:1 or SEQ ID NO:21 deficient cell. One can identify compounds that 
inhibit CSP fSEQ ID NO:3 or SEQ ID NO:41 or variants thereof by adding a test compound to 
the mixture to determine if the survival rate of cells is decreased by the addition of the test 
compound 
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[0081] Cells transformed with a nucleic acid molecule of the invention (histidine kinase fSEQ 
ID NO:51 , CSP fSEQ ID NO:31 or response regulator fSEQ ID NO:71 ) are useful as research 
tools. For example, one may obtain a cell (or a cell line, such as an immortalized cell culture or 
a primary cell culture) that does not express histidine kinase [SEQ ID NO:51 , CSP fSEQ ID 
NO:31 or response regulator [SEQ ID NO:8l insert a histidine kinase fSEQ ID NO:51 , CSP fSEQ 
ID NO:31 or response regulator fSEQ ID NO:71 nucleic acid" molecule in the cell, and assess the 
level of expression and activity. Alternatively, histidine kinase FSEQ ID NO:51 , CSP fSEQ ID 
NO:31 or response regulator fSEQ ID NO:71 nucleic acid molecules may be over-expressed in a 
cell that expresses a histidine kinase fSEQ ID NO:51 , CSP fSEQ ID NO: 31 or response 
regulator fSEQ ID NO:71 nucleic acid molecule. In another example, experimental groups of 
cells may be transformed with vectors containing different types of histidine kinase fSEQ ID 
NO:51 , CSP fSEQ ID NO:31 or response regulator fSEQ ID NO:7] nucleic acid molecules to 
assess the levels of polypeptides and peptides produced, its functionality and the phenotype of 
the cells. The polypeptides and peptides are also useful for in vitro analysis of histidine fSEQ 
ID NO:61 kinase, CSP fSEQ ID NO:41 or response regulator fSEQ ID NO:81 activity or 
structure. For example, the polypeptides and peptides produced can be used for microscopy or 
X-ray crystallography studies. 

[0082] The histidine kinase fSEQ ID NO:5 and SEQ ID NO:61 . CSP FSEQ ID NO:3 and SEQ 
ID NO:41 or response regulator fSEQ ID NO:7 and SEQ ID NO:8] nucleic acid molecules and 
polypeptides are also useful in assays for the identification and development of compounds to 
inhibit and/or enhance polypeptide or peptide function directly. For example, they are useful in 
an assay for evaluating whether test compounds are capable of acting as antagonists for 
histidine kinase fSEQ ID NO: 61 CSP fSEQ ID NO:41 or response regulator fSEQ ID NO:81 by: 
(a) culturing cells containing a nucleic acid molecule which expresses histidine kinase fSEQ ID 
NO:51 , CSP [SEQ ID NO:31 or response regulator peptides fSEQ ID NO:71 (or fragments or 
variants thereof having histidine fSEQ ID NQ:6] kinase, CSP or response regulator activity) 
wherein the culturing is carried out in the presence of increasing concentrations of at least one 
test compound whose ability to inhibit histidine FSEQ ID NO:61 kinase, CSP fSEQ ID NO:41 or 
response regulator FSEQ ID NO:81 is sought to be determined; and (b) monitoring in the cells 
the level of inhibition as a function of the concentration of the test compound, thereby indicating 
the ability of the test compound to inhibit histidine kinase fSEQ ID NO:61 . CSP fSEQ ID NO:4f 
or response regulator fSEQ ID NO:81 activity. 

[0085] Biofilms are developed on polystyrene microtiter plates to provide a rapid and simple 
method for assaying biofilm formation, and hence activity of the peptide fSEQ ID NO:41 /receptor 
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rSEQ ID N0:81 /kinase fSEQ ID NO:61 system. Formation of biofilms is initiated by inoculating 
20 ul of cell suspension into each well containing 2 ml of biofilm medium (4X diluted Todd- 
Hewitt Yeast Extract supplemented with final concentration of 0.01% hog gastric mucin) for 
overnight incubation at 37°C under an anaerobic condition. After 20-h incubation, fluid medium 
is removed and added with 2 ml of pre-warmed, fresh THYE plus 5% horse serum. The cultures 
are incubated for 30 minutes and each welMs supplemented with a frnal concenfration of 200" 
ng/ml of synthetic competence stimulating peptide (SCSP) and varying concentrations of the 
inhibitor and the incubation is continued. After 30 minutes, plasmid DNA (1 mg/ml) or 
chromosomal DNA (10 mg/ml) is added to each well and the cultures are incubated for an 
additional 2 hr. Planktonic cells are then removed and the wells are washed once with PBS 
buffer. Biofilm cells are collected into 2 ml fresh medium by a gentle sonication or washing the 
wells using a pipette. The samples are centrifuged at 12,000 x g for 5 min. Both biofilm and 
planktonic cells are resuspended into 200 jj\ of fresh medium and are immediately spread on 
THYE agar plus appropriate antibiotics. Transformation frequency is determined after 48-h of 
incubation. 

[0086] Homologues of the Streptococcus pneumoniae comD fSEQ ID NO:51/ E fSEQ ID 
NO:71 genes encoding a histidine kinase FSEQ ID NO:61 / response regulator fSEQ ID NO:81 
system were identified. This sequence was used to design primers to amplify the region from a 
number of S. mutans isolates. An open reading frame consisting of 138 nucleotides was 
located 148 nucleotides 5' proximal from the end of the comD homolog in the opposite 
orientation (Fig 1). This ORF was found to encode a peptide of 46-amino acid FSEQ ID NO:21 
in length, the precursor of the 21 -amino acid CSP fSEQ ID NO:41 . 

[0087] The comCDE genes fSEQ ID NO:231 were amplified from the genomes of several S. 
mutans isolates by PCR using primers designed based on the genome database sequence and 
their nucleotide sequences determined. The deduced amino acid sequences are compared 
among the isolates by sequence alignment to confirm identity. 

[0088] Genes are inactivated by integration of internal homologous fragments into the suicide 
vector pVA8912. Mutants defective in each of the individual genes (comC fSEQ ID NO:11. 
comD rSEQ ID NO:51, comE fSEQ ID NO:71 ) are inactivated and their phenotypes are 
compared to the parent strain NG8 for their abilities to form biofilms, tolerate acidic pH (pH 2-4), 
and transport and incorporate DNA. The knockout mutants of com D [SEQ ID NO: 51 and E 
rSEQ ID NO:71 were constructed by insertion-duplication mutagenesis, whereas the knockout 
comC fSEQ ID NO:1l mutant was created by allelic exchange via insertion of an erythromycin 
resistance determinant into the comC fSEQ ID NO:11 locus (Li et al, 2001). All mutant strains 
were therefore resistant to erythromycin. The wild-type strain was subcultured routinely on 
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Todd-Hewitt- Yeast Extract (THYE) agar plates (BBL , Becton Dickinson, Cockeysville, MD), 
whereas the mutants were maintained on THYE agar plus 10 ^ig/ml of erythromycin. A minimal 
medium (DMM) was prepared to grow biofilms by a modification of the method described 
previously (Loo et a/, 2000). The medium contained 58 mM K 2 HP0 4l 15 mM KH 2 P0 4 , 10 mM 
(NH 4 ) 2 S0 4 , 35 mM NaCI, 2 mM MgS0 2 -7H 2 0, 0.2% (wt/vol) Casamino Acids and was 
supplemented with filter-sterilized vitamins, (0.04 mM nicotinic acid, 0.1 mM pyridoxine HCI, 
0.01 mM pantothenic acid, 1 pM riboflavin, 0.3 pM thiamin HCI, and 0.05 pM D-biotin), amino 
acids (4 mM L-glutamic acid, 1 mM L-arginine HCI, 1.3 mM L-cysteine HCI, and 0.1 mM L- 
tryptophan) and 20 mM glucose. 

[0090] To determine if the synthetic peptide fSEQ ID NO:161 could restore defective 
phenotypes of the comC \SEQ ID NO:21 mutants, a chemically synthesized 21 -amino acid 
competence-stimulating peptide (CSP) fSEQ ID NO:41 (Li et al, 2001) was used in 
complementary experiments. The peptide was freshly dissolved in sterile distilled water to a 
concentration of 1 mg/ml. The CSP solution was then added to the cultures at a final 
concentration of 2 pg/ml 2 h after inoculation of bacterial cells. 

[0096] Two methods modified based on the protocols described by Perry ef al (Infect Immun, 
41:722-727) and Lindler and Macrina (J Bacteriol, 166:658-665) were used to assay natural 
transformation of biofilm cells. Biofilms formed on polystyrene microtiter plates were added with 
2 ml of pre-warmed, fresh THYE plus 5% horse serum (THYE-HS) immediately following 
removal of the BM medium, and the incubation continued at 37°C. After 2h incubation, a final 
concentration of 1 ng/ml plasmid DNA or 10 ng/ml of chromosomal DNA was added to each 
well. The cultures were incubated for an additional 2 h before collection of the cells for plating. 
To assay induction of competence by synthetic competence stimulating peptide (SCSP) [SEQ 
ID NO:16l . the cultures were incubated for 30 min and a final concentration of 500 ng/ml of 
SCSP ) [SEQ ID NO:161 was added to each well. After a 30 min incubation, equal amounts of 
DNA was added to each well (1 jig/ml plasmid or 10 jig/ml of chromosomal DNA) and 
incubation continued for another 2 h. Fluid medium was then removed from individual wells and 
the wells were washed once with PBS buffer. Biofilm cells were collected into 2 ml PBS buffer 
by gentle sonication or by washing the wells using a pipette. The samples were centrifuged at 
12,000 X g for 5 min. Both biofilm and planktonic cells were resuspended into 200 nl of fresh 
medium and were immediately spread on THYE agar plates plus appropriate antibiotics. For the 
biofilms developed in the chemostat, rods with biofilm cells were removed and placed into 2 ml 
of pre-warmed, fresh THYE-HS medium for 30 min incubation. Transformation was then 
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initiated by using the same methods as described above. The planktonic cells were also 
removed to compare the transformation frequency. After completion of the transformation 
procedures, both biofilm and planktonic cells were spread on THYE agar plus appropriate 
antibiotic. Transformation frequency was assessed after 48-h incubation. Transformation 
frequency was expressed as the number of transformants per \xg DNA per viable recipient at the 
time-of DNA added. - — " 

In the Claims: 

3. (Amended) The compound of claim 2, comprising a derivative of [SEQ ID NO:2] or fSEQ 
ID NO:41 . a fragment of [SEQ ID NO:2] or fSEQ ID NO:41 or a derivative of a fragment of [SEQ ID 
NO:2] or FSEQ ID NO:41 . 

4. (Amended) The compound of claim 3, wherein amino acids are removed from the N- 
terminus and/or C-terminus of [SEQ ID NO:2] or fSEQ ID NO:41 . 

5. (Amended) A pharmaceutical composition comprising all or part of the peptide of claims 1 
te-4 -claim 1 and a carrier. 

6. (Amended) A method of medical treatment or prophylaxis of caries or endocarditis, 
comprising administering the compound of claim l a ny o f c l a i ms 1 to 4 or th e pharm a c e ut i ca l 
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24. (Amended) The polypeptide of claim 23 comprising all or part of an amino acid 
sequence in [SEQ ID NO:2] or fSEQ ID NO:41 . 

29. (Amended) An isolated nucleic acid molecule encoding the polypeptide of any of claims 
24 22 to 28. 

30. (Amended) An antibody directed against the polypeptide of any of claims 24 22 to 28. 

- 32. - (Amended) A vaccine composition comprising all or part of "the peptide of any of claims 
24 22 to 28 and a carrier. 

34. CANCELLED 

35. CANCELLED 

36. CANCELLED 

37. CANCELLED 
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COMPLETE SET OF PENDING CLAIMS, AS AMENDED ON SEPTEMBER 17, 2001 

1 . A compound that competitively inhibits binding of CSP to S. mutans histidine kinase. 

2. The compound of claim 1 , comprising a peptide or an antibody. 

3. (Amended) The compound of claim 2, comprising a derivative of [SEQ ID NO:2] or [SEQ 
ID NO:4], a fragment of [SEQ ID NO:2] or [SEQ ID NO:4] or a derivative of a fragment of [SEQ ID 
NO:2] or [SEQ ID NO:4]. 

4. (Amended) The compound of claim 3, wherein amino acids are removed from the N- 
terminus and/or C-terminus of [SEQ ID NO:2] or [SEQ ID NO:4]. 

5. (Amended) A pharmaceutical composition comprising all or part of the peptide of claim 1 
and a carrier. 

6. (Amended) A method of medical treatment or prophylaxis of caries or endocarditis, 
comprising administering the compound of claim 1 . 

22. An isolated CSP or a fragment thereof having S. mutans CSP activity. 

23. The polypeptide of claim 22 comprising a S. mutans CSP. 

24. (Amended) The polypeptide of claim 23 comprising all or part of an amino acid 
sequence in [SEQ ID NO:2] or [SEQ ID NO:4]. 

25. A polypeptide fragment of the peptide of claim 24, or a peptide mimetic of the CSP. 

26. The polypeptide of claim 24 which is recombinantly produced. 

27. A polypeptide comprising a sequence having greater than 30%, 50% or 60% sequence 
identity to the polypeptide of claim 24. 

28. The polypeptide of claim 24, isolated from S. mutans. 

29. (Amended) An isolated nucleic acid molecule encoding the polypeptide of any of claims 
22 to 28. 

30. (Amended) An antibody directed against the polypeptide of any of claims 22 to 28. 

31 . The antibody of claim 30, comprising a monoclonal antibody or a polyclonal antibody. 

32. (Amended) A vaccine composition comprising all or part of the peptide of any of claims 
22 to 28 and a carrier. 

33. The vaccine composition of claim 32, wherein the peptide is coupled to a compound 
comprising -all or part of KLH, ovalbumin, or thyroglobulin. 
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Figure 1 

^^^y^ ^g e ment o f th ematic |nr m enrnri infi tfa o slaml ucd U ^ 

-gene-i a U amuribed from ih mvn pw i n o fel ul u ne, uul&e Uie ^^^L^^T 

tains tra a aaibed fl oa^i ngfe promot e r. - uwuC/DE e^es— 

•*»3fce c o mC gene i s 3cp^ted^em^ gw ^g eae 4 > y443 Weoti<ies ^ 



Streptococcus mutans 
ComCDE Operon 




Express Mail No. EL818623952US 
Inventor(s): Cvitkovitch et al 
AttyDktNo.: 1889-00401 
Contact: CM. Mmtz (713)238-800.0-— * 
Sheet 2 of 19 



Figure 2 

-nSeqw ne eb of the open re ading frame s oncoding - the signal p e ptid e prccursor - (€o t nC) ? t h cr 
^listidine-tebasc (CoinD), and the response rcgalato r (Com Efc 



* X _C , mutaxia cpmC_ ,geae_. _ _ _ 

Encodes a precursor to a signal peptide 
[ATGAAAAAAACACTATCM 
AGATTATCATTGGCGGA(AGCGGAAGCCTATCAA 
TTTTACACAAGCTTTGGGAAAA) ] TAA 



-fflUtana-SSP en co din g s egaencs-^ 
Competenc e- Signal P e ptid e - 
AGCGGAAGCCTATCAACATTTTTCCGGCTGTTTAA^ 
AAAA tSEQ ID NO si] 



r-E&oedes- a pro Lain that functions as a liiafcidino kina se— 
-rocepfeea? — 

[ATGAATGAAGCCTTAATGATACTTTCAAATGGTTTATTAACTTATCTAACCGTTCTA^ 
TTCTCTTGTTTCTATTTTCTAAGGTAAGT^ 
CTTTTTTCGATAAGCAATTTTCTG 

GTTTTATCCTGCAGAGCCTCTTTATTTTATAGCtttATCAATTTATCTTAATAGACAGA 

ATAGTCTTTCTCTAAATATATTTTATGGTCTGCTGCCTGTTGCCAGTTCTGACTTGTT^ 

AGGCGGGCAATCATATTCTTTATCTTGGATGGAACTCAAGGAATTGTAATGGGCAGTAG 

QlTTATAACCACCTATATGATCGAGTTTGCAGGAATAGCGCTAAGTTACCTCTTTCTCA 

GTGTGTTCAATGTTGATATTGGTCGACTTAAAGATAG^^ 

AAACGCTTGATTCCAATGAATATTACTATGCTTCTATACTACCTTTT^ 

GTATGTTATAGAGAGTTATAATGTGATACCGACTTTAAAATTTCGTAAATTTGTCGTTA 

TTGTCTATCTTATTTTATTTTTGATTCTGATGTCATTTTTAAGCCAATAT 

AAGGTTCAAAATGAGATAATGGCACAAAAGGAAGCT^ 

TAGTCAGCAAATAGAATCTCTTTACAAGGATATTCGAAGTTTCCGCCATGATTATCT^ 

ATATTTTAACTAGCCTCAGATTAGGCATTGAAAATAAAGATTTAGCTAGTATTGAAAAG 

ATTTACCATCAAATCTTAGAAAAAACAGGACATCAATTGCAGGATACCCGTTATAATAT 

CGGCCATCTAGCTAATATTCAAAACGATGCTGTCAAGG^^ 

TAGAAGCTCAGAATAAAAAGATTGCTGTCAATGTAG^ 

CCTGAGATGGAGTTGCTTGATTTCATTACCATACTTTCTATCTTGTG 

TGAGGCTGCTTTCGAATCATTAAATCCTGAAATTCAGTTAGCCTTTTTTAA 

GCAGTATAGTCTTTATC^TTCAGAATTCCaCCAAAGAAAAACAAATAGATGTGAGTAAA 

ATTTTTAAAGAAAACTATTCCACTAAAGGCTCCAATCGCGGTATTGGTTTAG 

GAATC^TATTCTTGAACATTATCCCAAAACCAGT^ 

TATTCAAGCAACTCCTAATAATAAAA1 TAG 



T > B. mutans -goroB gena- . 
■ Bncod ^B-a r es p ong&^^e gu l at o r that- activates transcription 
of- a number of - genes — 
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TTTTi 



GTATTGGAAGATGATTTTTTACAA.CAAGGACGT CTTGAAACCA 



3GTCCCTT1 
TTTATACTT 
3ATAAGAGA 



SCCTTTATi 



TGATGAACGGATTGAATTCTACGGCAGTATGACTGACATTGTTAAAA^ TA 



AACAAAACTGAGAGCTGTGAITGCTGATCAflAGGAGAGCAAAA 



]TQA 
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Figure 3 

-4ks-a mino ac i d o oquenoes of the signal peptid e precursor (ComC), the his l idme k Ji me 
(ComD), and th e response regu la tor (C o mE )*— 

S- tt mfcans come protein (CSP Precursor) - ... - 

" MKKTLSLKNDFKE IKTOELEI IIGGSGSLSTFFRLFNRSPTQALGK 




■S-r^mafca a c CoroD pr o tein "(Hiatldin o Kinase) 



MNEALMIIaSNGIiLTYLTVLFLLPLFSK7S1^ 

FYPAEPLYFIALSIYXJ^ONSLSLN^^ 

IITTYMIEFAGIALSYLFLSVFNVDIGRLKDSL^^ 

YVIESY1^IPTLK?RKPVVIVYLILPLIL 

SQQIBSLYKDIMFJfflD^ 

GHLMTI QMDAVKG I LS AKILEAQNKKIAVHVEVS SKIQLPEMELLDF I T ILS ILCDMAI 

EAAFESLWPEIQLAFFKKBJGSITO^ 

NHILEHYPKTSLQTSNHHHLFKQLLIIK 



m ^ G. routaas CamE - protein (Regpon c e Regulator ^ 
MISIFVLEDDFLQQGRLETTIGtf^ 

DXEIKKEEKKGLEV^QIRQHNPSAVIVFVTTHSEFMPLTFQ 
FSHRIESALYYAMENSQKNGQSE 
DERIEFYGSMTDIVKMDKRLFQC^ 
TKLRAVIADQRRAK 
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Figure 4 

— Th e d e duced amino acid -se qu e nce of the signa l-pep tid e pre es rsor in various a lra ina and ~ 
- -^jteyB ^eterf- eiflflv^ffl--ffltft- TV ftricinnl p?pt idir i n OT prf fl fi fld n 16 nmino acid pep t id e 

. that itt flayed qftpr-flw. glyci'nfVglye m a Tfcfi i riiing tn, mt o an Q^ijy* fi jfr n fll prfptidf: ■ 



BM71 CSP 1 MKKTPSLKTOFKEIKTDELEIIIGGSGSLSTFFRLFIIRSFTQALGIC 46 

GBX4 CSP 1 MKKXIiSIiK^FI^ZKTDBLEXXIGGSGSLSTFFRLFin^FTQALCT 46 

H7 CSP 1 MKKTTLSLKNDFKSICTDELEIIIGGSGSLST^ 46 

JH1005 CSP 1 MKKT1^LKN33FKEIKTDELEIIIGCSGTI*STFFIU*F1^ 43 

LT11 CSP 1 MKKTIjSLKiroFKBIKTDEIiBIIIGGSGSI^ 46 

STG8 CSP 1 MKOTLSIiKNDFKEIETDSI^a IXIGGSGSLSTFFIiLFlffaS PTQALGK 46 

CSP 1 MKKTLSI*KirDF2G2IKTDEIiEIIIGGS^ 46 
***+ ************************ _ *»«*******•****-«- 

consensus: 1 MKOTLSLKSroFKEiraDELSXIIGG SGSLSTFFRLPNRS FTQALGK 46 

predicted cleavage alter A 
Figure 5 

— ^Ehe-nyntlictio signal peptide that ip effective at inducing competence, bioXUin foimalluii 
■ -a nd a cid tolerance in Streptococc u s mut ans-. — - 



SGSLSTFFELFMRS FTQALGK ( [SEQ ID NO 



(tSJSQ ID NoT^) 
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Figure 6 

_JIhe. natural act ivity of the sign a l/r ec eptor system functioning in vitro in m o d e l biufilffl s ' 
, — as detgmiin c&fe y t h e abili t y of various strains of S. mutaiu to accept donor plasroid DN Ar~ 
^^.Coafcmnfi e rythromyc i n reairtane er " 



Genetic Transformation in S. mutens Bioftlms 
o.ot 




UAB159 



JH1005 
Strains 



GB-U 
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Figure 7 

^■i nduction of gen e tic tetms f orrnafion in Streptococ c us mutatis 
- fey s ynthetic com p e t ence stimulating p e ptide (SCSP) 1 



Strain 


Peptide added No peptide 
Number of Number of 
Transformants/Recipients Transformants/Recipients 


UAB15 
JH1005 2 


4.65 X10" 1 1.78X10* 
6.98 X10' 2 0 



The final concentration of SCSP used was 500 ng/ml. 
The strain contains a nonsense mutation in the comC gene encoding the CSP, 



Mail No. EL818623952US 
Inventor(s): Cvitkovitch et al 
AttyDktNo.: 1889-00401 
Contact: CM. Mintz (713)238-8000 
Sheet 8 of 19 



Figure 8 

- L ia t of th c^rimers ^ed to amplify th o gft n ftn o r int e rnal ro giu ua of fl i c t j irgct genes-fey- 
e — P 0 ^ ^ 0 chdiu i c ac tio n (PCR) for subcoquont sequencing o i iuac tivatr^-1 - 

ComC region 

ComC Primer Pain F5-B5 

[F5] 23406-23424 5 - AGTTTTTrGTCTGGCTGCQ -3* 

19 nt forward primer 
pctG+C: 47.4 Tm: 503 

tB5] 240S6-24037 5'- TCCACTAAAGGCTCCAATCG -3' 

20 nt backward primer 
pcrG+C: 50.0 Tm: 51.9 

651 nt product for F5-B5 pair (23406-24056) 
Optimal annealing temp: 50.3 
pctG+C; 30.9 Tm; 71.5 

ComD region. 

ComD Primer Pain Fl-Bl 

[Fl] 392-415 5*- CGCTAAGTrACCTCTTTCTCAGTG -3 T 
24 nt forward primer 
pctG+C 45.8 Tm: 51.6 

[Bl] 683-663 5'- GCTTCCTnTGTGCCATTATC -3* 

21 nt backward primer 
pctG+C: 42.9 Tm: 50.8 

292 nt product for Fl-Bl pair (392-683) 
Optima] annealing temp: 49 .5 
pctG+C; 30,8 Tm: 70.2 

ComE region 

ComE Primer Pain Fl-Bl 



[Fl] 145-165 5 - CCTGAAAAGGGCAATCA.CCAG -3' 

21 nt forward primer 
pctG+C; 52.4 Tm: 55.9 

[Bl] 606-585 5 - GCGATGGCACTGAAAAAGTCTC -3 1 

22 nt backward primer 
pctG+C 50.0 Tm; 55 A 

462 nt product for Fl-Bl pair (145-606) 
Optimal annealing tetnp: 53.6 
pctG+G 2&3 Tm 74.1 
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Figare9 

^Ge mCDE local rogion. The ComC (first high l ighte d r egi on ; Bucl co t idre 1 01 to 24 1)r~ 
^ComD-(s econd highlig hted rcg^ H i uelc o tld ft A 383 to 170B) ami C uu iE (diud highlig hte d 
^legioa H i ucl cot ides 1705 t o 2 4 57) p rotcias af e^ ghlighted. — 

- Sequence Range* 1 to "2557 

ID 20 30 40 50 

ACkTTATGTGTCCTAAGGAAAATA^ 
XGXAAJTACACACGATTCCTTTTAX^rGAAAAAGT^ 

<K K !• P I W S K 



60 70 80 90 100 

TTTTCATAAAAAATAGTATACTAATTI^^ 
AAAAGTATTTTTTftXGRTATGATTAATATXAGTTTTTTC 
<KMPFLISlIItiF LIiYli 



110 120 130 140 150 

ATGAAA&AAAG&CXATCATT^ 




ORF RF[2] : > 



«XFFVSDNFFSXXiSI&VS 



160 170 180 190 200 

TGAATTAGAX&TTAXCAlTGft 
ACTTAATCTCTAATAGTAACCrc^ 

ORF &F[2] > 

<$ U S 1 I M 



310 220 230 240 250 

GSCTGTOXAACAGAAQTT^ 

CCGACAAATTGTCTrTCAAAATGTGTTCGAAACCC^ 

ORF RF(2] > 

260 270 280 290 300 

acattggaataaaao^ggctggatttmtattc 

TOTAACCTTAXTTTGTTCCG^CTAAAT^ 

310 320 330 340 350 

aaaataaaaatacagggttaaataatcaagtgtgctctcgtggatgagaa 
tttxatcxttatgtcccaatttattagttcacacgaczagcacctactctt 

360 370 380 390 400 

GATAAAACTATCTCTlAGAGAATAGGCCTCCTCTATTTTATTATTAGGAG 
CTATTTTGATAGAGAATCTCTTATCCGGAGGAGAT^ 

< ORF RP[ 



410 420 430 440 450 
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AACGAACTTATTTACTACTACTAACGAAC^^ 

c ORF RPI4] C 



400 470 430 490 500 
AATGTTCAAGAATATGATTCACCTTTGCTAAACC^ 
CTACAAGXTCTTATACTAAGTGGAAACGATTTCGT^ 
<KBLlHffVXALGlGRSS 
< ORF RF [41 C 

510 520 530 540 550 

CCTTTAGrGGAATAGTTTTCTTTAAAAATTT^ 
G<»UUVrCACCTTATCAA&AGAAAm 

< ORF RF[4j C 

560 570 5B0 530 600 

TTCTTTOGTGGAATTCTGAATGATAAAGAdTATACTC 
AAGAAACCACCTTAAGACTTACTATTTCTCATATGACGGTAAAA AGAAT T 

< ORF RF[4] C 



610 620 630 640 650 

AAAAGGCTAACTGAATTTCaGGATTTAATG^ 
TTOT CCGATTGACT'IAAAGTCCIAftAlITACTAAGCTTO 

< ORF K?[4l C 



$60 670 680 690 700 

GCATTATC^CACAAGATAGAAAGTArGGTAATGAAATCAAGCAACTC CAT 
CGTAATAGTGTGTTCTATCTTTCATACCATOACTTTaGT^ 

ALSHKISSMVMKSSNS I> 
ORF RF [3] * 

< ; ORF RF[4] C 



710 720 730 740 750 

CTCACGCAGTTG1ATTTTACTTGAGACTTCT 

GAGTCCGXCAAC^TAAAATGAACTCWAAGATGTAACTGTCGT13\GAAAA 

SGSCXLX»ETSTXiTAX7> 
' ORF Rfr[3] > 



ORF RF[4] C 



750 770 780 790 800 

TATTCTGAGCTTCTAAGATTTTTGCTGACAAGATA^ 
ATAAGACXCGAAGATTC1!AAAAACGACTGTXCTATGGGAACTGTCGTAG 
L F> 

< ORF RF[4] C 



810 820 830 840 850 

ttttgaatattagctagatggccgatattataacgggtatcctgcaattc 

AAAACTTATAATCGATCTACCGGCTATAATATTGCCCATAGGACGTTAAC 
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800 870 880 690 $00 

ATGTCCTGTTITTTCTAAGATTTGATGGTAAATC^ 
mCAGGAC2kAAAAAGATTCTARACTACCAT^ 

< ORF RF[4] C 

910 920 930 940 £50 

AATCTTTATTTTCAATCC^ 



9ff0 970 $80 590 1000 

TCATGQOTQAAACTTCGAATATCCTTGXRAAGAGATTCTAlTTGCTGACT 
^TACCGCCTTTGAAGCTTATAGGJACATCT 

KAEVS39rXLVKRF7IiLT> 

< ORF RF[4J C . 



1010 1020 1030. 1040 10S0 

ATACTGGGTGATATTTCCy^ 
TAT^CCCACTATAAAGCTTAGACTCGAAC^^ 
XLGDISNXiSFLLC HYL I> 

> 

< _ORF RF[4] C 

1060 1070 1080 1090 1100 

TXTGAAGCTTTTGTTTGGTATATTC^ 
AAACTTGGAAJACAAACCATAXAACCGA^^ 

< ORF RF[4] C 

1110 1120 1130 1140 115 0 

AATAAAATAAGATAGACAATAACto 
TrATTTTATTCTftTCTGTTATTGCTGTTTAAATGCTTT^ 

< ORF RF[4] C 

1160 1170 1180 1190 1200 

tatcacattataactctctataagatacaatacctgtattaaaaggtagt 
atagtgtaatattgagagatattgtatgtt^ 

< ' " O RF M 141 C 

1210 1220 1230 1240 1250 

axagaagcatagtaatattcattggaatcaa^ 
tatcttcgtatcatt&taagtaac cttagttcgcaaaaaactggaagtag 

< ORF RF[4] C 



Tn 
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1260 1270 1280 1290 1300 

TTGGTCAAACTATCTTTAACmMACCA^^ 




1310 1320 1330 1340 1350 

AAAGAGGTAACTTAGCG CTATTCCTGCA^CTCGATCATACl^CSC^GGTTA 
irrrCTCCATTGAATCGCGATAAGGACGTTTGAGCTAGTATAT 

< ORF RF[4] C m ' W " ' 

1360 1370 1380 1390 1400 

TAATGCTACTGCCCACTACAATTCCT 
AITACGATG^GGGTAATGTTAAGGA 

< m orp Ry [4] c " 

<L B K L E M W S 5 S Y 
< 

1410 1420 1430 1440 1450 

ATC^TTGCCCGCCTAAACAAGTCAGAACT 

TACTAACCGOCGGATTTGTXCflfiTCTTGACCGTTGTCCGTCGTCTGGTAT 

< — — " • — b RF"RPl4T c^ ^ 

<SQGGIjCTLV p l l c c v~m 

< 

1460 1470 1480 1490 1500 

aaatat&tttagagaa&gactattctgtct^ 



_ORF RF[4] C 



1510 1520 1530 1540 1550 

CTATAAAATAAAGAGGCTCTGCAGGATAAAACAGG 



W RF[4] C 



1560 1570 1580 1590 1600 

GTAACAGCAATCJLTTATGAGAAAATTGCT^^ 



_ORF RP[4) C 



1610 1^20 1630 1S40 1650 

CTTTTTCGAXAAAGTGACATTACrrACCTTA^ 



_ORS RF[4] C 



1660 1670 1*80 xeso 1700 

AEAfcAACGGTttU^TAAGnT^ 



_ORF RP[4] C 
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1710 1720 1750 1740 1750 

^CATTCATTTTGCTCTCCT 
AGTAAGTAAAACGAGAGGAAACTAGTCGTTAGTGTC^ 

< ORF RP [5] C 



1760 1770 1780 1790 1600 

< " _orf RF[S] c 

1810 1820 1830 1640 1950 

GCGTTTTTTCCG&TGAATACGGGTAATATTGGCA^ 
CGCAAAAAAGGCTAGTT&TGCCCATTATA^ 

< ORF RFI51 c 

1850 1870 1B80 1890 1900 

AGCGATGGCACTGAAAAACTCTCT 
TCGCTACCGTGACTTTTTCAGAGAATAGGT^ 

MAIi K K S L I H F N N V S H> 

ORF RP[1] > 

<_ ORF RF 15] C 



1910 1920 1930 1940 1950 
CTGCCGTAGAATOCJ^TCCGTTCAT 
GACGG CATCTTAAGlTAGGC^AGTAGTATTCAl^^ 
TAVEFNPFI IS I K A E L M> 
O RF RF[1] _ ; > 



< ORF RF [51 C 

<A T SffLGffMMIi I FAS S X 
< ORF RF[6] C 



1960 1970 1980 1990 2000 



ggctgttgaagatgtttcaaaatacagaaxctca 
ccgacaacttctacaaagttttatgtcttagagt 

6 C> 

< ORF RF[5] * C " ' i 

<PQQLHKIiICFRIiIiI»SRF 
< ORF RF[6] C 

2010 2020 2030 2040 2050 

ACTGACTlTCAGATGAATGGA^ 

TGACTCAAAGTCTACTTACCTTTTATl^l'TTCAAGGAGACTAACTGGTAAG 

- < ORF RF[5] " £ "" 

<SLKLHlSFLFa-RQ5TVMR 
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ORF HP [6] C 



2060 2070 2030 2090 2X00 

TTCTGGCTGTTTTCCATAGCAT5UTA 



. <R A T K W L M 
< ORP RF [6j C 



ORF RF[5] C 



2X10 2X20 2X30 2X40 2X50 

GAACTC CTCAGGATTCAAAGATTTATCAATAAAATC^^ 
CTTGAGGAGTCCTAAGTTTCTAAAT&GIUAIT^^^ 

< ORF RF[5] C 

2X60 2X70 2X60 2X90 2200 

GATACTGAAAAGO^GGGGGATAAACtCAGAAT 
CTATGACTTTTCACTCCCCGTATTTC 

H C R D K D> 



■ ORF RF[5] C 



22X0 2220 2230 2240 2250 

ATAACTGCACTAGGATTATGCTGTCTJAT 
TATTCACGTGATCCTAATACGAC^GATO 

MNCTRIMLSNIilGYFQS> 
< 4 ORF RP[5] C 



2260 2270 2280 2290 2300 

TTTCTTTOCCTCTTTTTTGA^ 

AAAGAAAAGGAGAAAAAACTAAAGTTATAGGTXTrTCTTTTAGACCACTA 
FLFI.FFDF jriQKBNIi VI> 

> 

< ORF RF[S] C 

23X0 2320 2330 2340 2350 

TCCCCrrriVAGGGATAGCGTCAAT^ 
AttSGGAAAAGTrcCTATtXCAGTTA^ 

AIiFRDSVUKLLWFSK N> 

> 

< " * ORF RFtS]' C 



2360 2370 2380 2390 2400 

CTCAATTCTTTATAAGACCAATTTTTTTCTTT 
C^rTAAGAAATATTCTGGTTA^^ 

S QF F I RP X FF F H D 6 C K G> 



M I A A M V> 
> 



< ORP RF|51 C 
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2410 2420 2420 2440 2450 

GGTTTCAAGACGTCCTTGTTGTAAAAAATCAT CTTCCAATACAAAAATAG 
COUL&GTTCTGCAGC^ 
G P K T S L L> 

> _ 
K 



V B R R P C C 



KSSSN*KI> 



< OR? RP[5] C 



2460 2470 2430 2490 2500 

AAAXC&TTATTTCTCCTTT3&TC2TC 

BIIISPL IFYLG* 




2510 2520 2530 2540 2550 

ATACACAGAAAACTCTATAAAAC^ 
TATGTGTCTTTTCCATAIO^^ 

AATAACC 
TTATTGO 
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Figure 10 

■foefad e d both u pst re a m an d d o wn s tr e a m- (p re moter Is upstream): 



c . mufcano coraX gene ■ 
ATGGAAGAAGATTTTGMlATTGTTTO 
CCGTTATTACTTTATTAAAATGTGGACTCGTGAAGAT^ 
TTTTGCACCAATTATTAAG<MAACATCC^^ 

ATCTATTTTAAGACACGTTTTTCTAATTACATTAAAGATGTTTTGCGTCAGCAAGAAAjG 

TCajG&AACGTCGTTTTAATAGAATGTCTTATGAAG^^ 

TGTC1AAGTGGCGGTATGCAATTGGATGAATATATTTTATTTCG 

tataaaoiaggtctgagtactgaaaagcaagagctgtttgagcgcttggtagcaggaga 
gcactttttgggaaggcaaagtatgctgaaagatttacgtaaaaaattaagtgatttta 



mutans ConflE -protein 
MEEDFEIVFNKVKPIVWKLSRYYFI 
IYFKTRFSFZIKDVLRQQESQKRRPHRMSYEH^ 
YKQGLSTEKQELFERLVAGEHFLGRQSMLIODLRKKLSDFKEK 

. ututana cu n tx geno l o cal r e gi o n 
GTAAATAAAACAGCCAGTTAAGATGGGACATTTATGTCCTGTTCTTAAflJGTCTTTTTCG 
TTTTATJU^TAATTTTATTATAAAAGGAGGTCATCGTAATAGATGGAAGAAGATTTTGAA 
ATTGTTTTTAATAAGGTTAAGCCAATTGTATGGAAATTAAGCCGTTATTACTTTATTAA 
AATGTGGACTCGTGAAGATTGGCAACAAGAGGGAATGTTGATTTTGCACCAATTATTAA 
GGGAACATCCAGAATTAGAAGAGGATGATACAAAATTGTATATCTATTTTAAGACACGT 
TTTTCTAATTACATTAAAGATGTTTTGCGTCAGCAAGAAAGTCAGAAACGTCGTTTTAA 
TAGAATGTCTTATGAAGAAGTCGGTGAGATTGAACACTGTTTGTCAAGTGGCGGTATGC 
AATTGGATGAATATATTTXATTTCGTGATAGTTtTGCTTGCATATAAACAAGGTCTGAGT 
ACTGAAAAGCAAGAGCTGTTTGAGCGCTTGGTAGCAGGAGAGCACTTTTTGGGAAGGCA 
AAGTATGCTGAAAGATTTACGTAAAAAATTAAGTGATTTTAAGGAAAAATAGTTAAAAA 
GGGAAAGAATGGAACATGTGATTGTACCATTCTTTTTGGTTGAAAATTAAGAAAAGTTA 
TTATAAATTATTGGTTTAACATGCCATATTA 
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Figure 11 

^fte-ee niA and cu ii iB nucl e otide and amino a nd s p qi i encoc. ComA ond ComB aro-thg- 
components of - tho - CSP cxpor terr"~ 

S F ~ c - mutanb oorcft" gen e^ " ~ ~ " 

ATGAAACAAGTTATTTATG 

TATCAAAAGAGTAACAAAAAGGGGAGGGACAGTTTC 

GGCAGTCTAAGTTGTTTTGGCGCAGACATTATAAGCTAGTACCTCAGATTGATACCAGA 
GACTGTGGGCCGGCAGTGCTGGCATCTGTTGCAAAGCATTACGGATCTAA 
CGCTTATCTGCGGGAACTCTCAAAGACTAACAAGCAGGG^ 
TTGAAGCTGCTAAAAAGTTAGGCTTO^ 

TTTGATTATAATGATTTGACCTATCCTTTTATCGTCCATGTGATTAAAGGAAAACGTCT 

GCAGCATTATTATGTCGTCTATGGC^GCCAGAATAAT^ 

ATCCTTCRGTTAAGGTGACTAGGATGAGTAAGGA^^ 

CTTGCAATTTTCCTAGCTCCTCLAGCCTAACTATAAGCCTCATA^ 

TTTGTCTAATTTCTTCCCGTTGATCTTTAAGC^ 

TAGCTAGCTTGATTGTGACGCTCATTGATATTGTCGGATCATACTATCTCCAAGGAATA 

TTGGACGAGTACATTCCTGATCAGCTGATTTCAACTTTAGGAATGATTACGATTGGTCT 

GATAATAACCTATATTATCCAGCAGGTCATGGCTTTTGC^UUVAGAATACCTCTTGGCCG 

TACTCAGTTTGCGTTTAGTCATT 

CTTCCTATGTCTTTCTTTGCGACAAGGCGAA 

TGCCAATCAGATTATTGATGCTGTAGCGTCAACCATCTTTTCAATCTTTT^ 

CTATGGTAATTTTGGTTGGTGGGGTT^ 

ACCTTGCTCTCCATTCCGATTTATGCCATCAT^^ 

GAAAATGAATCACGAAGTGATGGAAAGCAATGCTGTGGTAAGTTCTTCTATCAT^ 
ATATCAATGGGATGGAAACCATTAAAT 

ATTGATAGTGAATTTGTTGATTATTTGGAGAAAAACTTTAAGCTACACAAGTATAGTGC 
C^TTCAAACCGCATTAAAAAGCGGTGCTAAGCTTATCCTC^TGTTGTCATTCTCTGGT 
ATGGCTCTCGTCTAGTTATGGATAATAAAATCTCAGTTGGTCAGCTTATCACCTTTAAX 
GCTTTGCTGTCTTATTTCTCA^ 

GCAGTCAGCTCGCGTTGCCAATACACGTCTTAATGAGGTCTATCTTGTCGAATCTGAAT 
TTGAAAAAGACGGCGATTTATCaGAAAATAGCTTTTTAGATGGTGATAraTCGTTTG^ 
AATCTTTCTTATAAATATGGATTTGGGCGAGATACCTTATCAGATATTAATTTATCAAT 
CAAAAAAGGCTCCAAGGTCAGTCTAGTTGGAGCCAGT^ 
CTAAACTGATTGTCAATTTCTACGAGCCTAACAAGGGGATTGTTCGA^ 
GATTTAAAAGTTATTGATAAGACAGCTTTGCGGCGGCATATTAGCTATTTGCCGCAACA 
GGCCTATGTTTTTAGTGGCTCTATTATGGATAATCTCGTTTTAGGAGCTAAAGAAGGAA 
CGAGTCAGGAAGACATTATTCGTGCTTGTGAAATTGCTGAAATCCGCTCGGACATTGAA 
CAAATGCCTCAGGGCTATCAG^CAGAGTTATCAGATGGTGCCGGTATTTCTGGCGGTCA 
AAAACAGCGGATTGCTTTAGCTAGGGCCTTATTAACACAGGCACCGGTTTTGATTCTGG 
ATGAAGCCACCAGCAGTCTTGATATTT 
CAGATGACGGAGAAAAG^TAATTTCTGT^ 
TGACGAAGTCATTGTCATGGATCAGGGA 
TTTTAGCTAAGCAAGGTTTCTATTATAAC CTGTTTAAT 
^ — ■ — S > tmitana - CumA piuLeii t- 
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iVIYWL I VI AVNILLE 1 1 KRVTKRGGTVS S SNPLPDGQS KLFWERHYKLVPQ IDTR 
DCGPAVLASVAKHYGSNYS IAYLRELS KTNKQGTTALGIVEAAKKLGFETRS IKADMTL 
FDYNDLT YP F I VHVI KGKRLQHYYWYGS QNNQLI I GD PD P S VKVTRMS KERFQS EWTG 
IiAIFLAPQPNYKPHKGm(NGLSOTFPLIFKQKMJ«TYIIIASLIVTLIDIV 
IJJEYIPDQLISTLGMlTIGLIITYIIQQVMAPAKEmAVLSLRLVIDVILSYIKHIPT 
LPMSFFATI»TGEITSPJ?1I1ANQIII^ 

TLLSIPIYAIIIFAFLXPFEKMNHEVMESNAVVSSS IIEDINGMETTKSLTSESARYQN 
IDSEFVD YLEKNFIQiHKYSAXQTAIiKS GAKLIIiNWILWYGS RLVMDNKI SVGQLITFN 
ALLSYFSNPIENIINLQSXLQSARVAlTC'RIdTSV^ 

NLSYKYGFGIU5TLSDimSIKKGSKVSLVGASGSGIOTTIAKLIWFYEPlTO3ITOrNG^ 
DLKVIDKTALRRHISYLPQQAYVFSGSIMDNLVLGAKEGTiSQEDIIRACEIAEIRSDIE 
QMPQGYQTELSDGAGI SGGQKQR I ALARALLTQ APVL I LDEATS SLD I LTE KKI I SWLL 
QMTEKTI IFVAHRLSISQRTDEVIVMDQGKIVEQGTHKELI1AKQGFYYNI1FN' 

ST _ 

^ — 6. mutano gotoB g e n e 
ATGGATCCTAAATTTTTACAAAGTGC^^ 
GACaCTATTAATTGTTCCTTTGGTC^ 
TTGCTAAAAAAGA^TTACAGTGATTTCTACT 

GATGTTATCCAATCTTACAGTGACAGTTCAATCATTAAAAATAATTTAGATAATAATGC 
AGCTGTTGAGAAGGGAGACGTTTTAATTGAATATTCAGAA7A 
AGACTGAACAAAAGAATATTATAAAAGAAAG 
AAACACCAAAAGAGCAAGAAAAAGAAGAAGTCTAAGAGCAAGA 

GAA2iAAGAAATCGAAAGACAAGGAAAGCAGCTCTGACGATGAAAATGAGACAAAAAAGG 
TTTCGATTTTTGCTTC^AAGATGGTA^ 

AATATTATTCCGAAGCAAACCGAGATTGCTCAAATCTATCCTGATATT^ 

AAAAGTGTTAATCACCTATTATGCTTCITCTGATGATGTTGTTTCTATGAAAAAGGGGC 

AAACCGCTCGTCTrrCCTTGGAAAAAAAGGGA^TGACAAGGTTGTTATTGAAGG^ 

ATTAACAATGTCGCTTCATCAGCAACTACTACTAAAAAAGGAAAT 

TGCCAAAGTAAAGGTTTCTAAGAAAAAT^ 

AGACAGTCACTGTCATTGATAAAAAGACTTATTTTGATTATTTCAAAGATAAATTACTG 
.TAAAATGGATAAT 

S. tauta as C o mB p - rctein 
MDPKFLQSAEFYIU^YHNFATLLIVPLVCLIIFLVIFLCF 
DVIQSYSD^IlKWNLDmAAVEKGDVL 
KHQKSKKICKKSKSKKASKDKKKKSKDK^ 
KTIIPKQTEIAQIYPDIQKTRKVLITYYASSDDWSM^ 

IKNVAS S ATTTKKG3STLPKVTAKVKVSKKNS KL IKYGMTGKTVTVIDKKTYFDYFKDKLL 
HKMDN 
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Figure 12 



